Partial least squares: a versatile tool for the analysis of high-dimensional genomic data
نویسندگان
چکیده
Partial least squares (PLS) is an efficient statistical regression technique that is highly suited for the analysis of genomic and proteomic data. In this article, we review both the theory underlying PLS as well as a host of bioinformatics applications of PLS. In particular, we provide a systematic comparison of the PLS approaches currently employed, and discuss analysis problems as diverse as, e.g. tumor classification from transcriptome data, identification of relevant genes, survival analysis and modeling of gene networks and transcription factor activities.
منابع مشابه
Spectrophotometric Simultaneous Kinetic Determination of Iodide and Iodate Using Partial Least-Squares Calibration Method in a Single Kinetic Run
A rapid, sensitive and versatile kinetic method is presented for the simultaneous spectrophotometric determination of iodide and iodate by partial least-squares regression (PLS) using original and derivate data named as absorbance and rate data. The method is based on the catalytic effect of the cited anions on the reaction rate between Ce(IV) and As(III) in 2 mol l?1 sulfuric acid medium. The ...
متن کاملRobust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data
Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...
متن کاملMethods for regression analysis in high-dimensional data
By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...
متن کاملAn improved structure models to explain retention behavior of atmospheric nanoparticles
The quantitative structure-retention relationship (QSRR) of nanoparticles in roadside atmosphere against the comprehensive two-dimensional gas chromatography which was coupled to high-resolution time-of-flight mass spectrometry was studied. The genetic algorithm (GA) was employed to select the variables that resulted in the best-fitted models. After the variables were selected, the linear multi...
متن کاملDesigning a Commercialization Model for Research Achievements at a Military University Research Institute by Partial Least Squares Structural Equation Modeling
Background and Aim: Today, in universities and research institutes, the lack of attention to commercialization makes it impossible or difficult to enter the markets for technology and research products. therefore, this study aims to design a commercialization model for research achievements of a military research institute. Methods: This descriptive-analytic study was done in a cross-sectional ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Briefings in bioinformatics
دوره 8 1 شماره
صفحات -
تاریخ انتشار 2007